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! Abstract 

5— ( 

Q_i| A novel approach to protein folding dynamics is presented. We 

' suggest that folding of protein may be mediated via interaction with 

£f) • solitons which propagate along the molecular chain. A simple toy 

model is presented in which a Sine-Gordon field interact with another 
\ field <p corresponding to the conformation angles of the protein. We 

demonstrate how a soliton carries this field over energy barriers and 
' consequently enhances the rate of the folding process. The soliton 

compensate for its energy loss by pumping the energy gained by the 
folded field. This scenario does not change significantly even in the 
CrN '. presence of dissipation and imposed disorder. 



The possibility for existence of solitons in bio-molecules has been widely stud- 
ied since the work by Davidov which described solitons in proteins as means 
to transfer energy and charge It is evident today that solitonic excitation 
should be included in order to describe properly the biophysics of proteins and 
DNA molecules (e.g. see [0, [[|, [|J), though their importance to the biological 
activity is still controversial. 
^ ■ In this paper we suggest that solitons may take part in the process of protein 

folding. The way in which a protein, a linear chain of amino acids, folds into a 
well defined, three dimensional native structure, is still far from being understood, 
although considerable effort is devoted to the subject. Since many biological 
proteins fold correctly without assistance, it is clear that all the information 
about their native state exists already in the sequence of amino acids 0. The 
folding dynamics may be viewed as navigation process in the energy landscape 
of the protein conformational space. It is believed that the native structure 
corresponds to a small region in that space located in the vicinity of the ground 
state. Identifying this ground state is highly non trivial, but this is not the entire 
story. The ground state should be accessible dynamically within time scales not 
larger than minutes. As pointed out by Levinthal, stochastic search for the ground 
state over a random landscape might take cosmological time. From this fact one 
may draw the conclusion that the landscape of natural proteins is not random, 
but, rather, directed towards the ground state 0. Conventionally, overcoming the 
energetic barriers is ascribed to a stochastic and uncorrelated thermal activation. 
Here we shall explore another option. Surly, random thermal processes have 
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important role in folding dynamics, but it may be that other processes, correlated 
and deterministic, dictates the folding pathway To illustrates what is meant by 
this, suppose that energy gained at a point where the molecule already reached 
the lower energy state, could propagate efficiently to another point along the 
molecule and induce consequent fold there. This may go on further, as more 
energy is now released. In this way, large sections of a protein can fold very fast, 
and moreover, the folding process can be orchestrated deterministically. Solitons 
provide a mechanism for such stable, non dispersive energy transfer. We shall 
refer to a folding process induced by the propagation of a solitonic wave along 
the molecular chain as a Soliton Mediated Folding (SMF). 

In order to explore the possibilities of soliton interaction with local conforma- 
tion angles of the molecular chain, we introduce a toy-model inspired by the es- 
sential properties of protein molecules. Proteins have two local angles per residue 
(usually denoted <fi and ip), which are relatively free, and may be considered as the 
only relevant degrees of freedom. For almost all amino residues in a polypeptide 
chain there are two distinct minima of the local potential energy in the (0, ip) 
plane, corresponding to the a and the /3 local conformation of the chain. The 
only exceptions are glycin which has four minima and proline which has only one. 
We shall use a scalar variable <p to represent local conformation of the protein. 
The local potential energy will be simply modeled by an asymmetric 4 double 
well potential, namely 

\/(0)=^ + 5) 2 (0 2 -^+^ 2 -2), (1) 

where 5 is the asymmetry parameter, ranging from —1 to 1. The two minima are 
positioned at (f) = ±1. The energy difference between the minima is 
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AE = —e5. (2) 
3 V ; 

The maxima is positioned at = —5 and its energy is always zero. We shall 
denote by 9{x) an additional field (x is the position along the protein), for which 
solitonic excitations can be realized. Excluding the interaction with 0, we choose 
its dynamic to be governed by the Sine-Gordon equation. It is convenient to 
picture 9 as an integral over a charge density field. Indeed, it was suggested 
that the dynamic of a charge in an ordered linear media is governed by the 
SG equation 0, Nevertheless, many nonlinear phenomena can by described 
using equations which allow solitonic excitation, and the SG dynamics was chosen 
mainly for practical reasons. 

In order that solitons propagating through the protein backbone could mediate 
changes in its conformation, an interaction of an appropriate form should be 
introduced. Consider the following interaction potential: 

^(MH^a-cos^ 2 , (3) 

where A is a positive parameter. This is a natural way to introduce an interaction 
with the SG model (for example, interaction with an impurity in a long Josephson 
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transmission line ||). Note that when 9 = mod 27r there is no interaction. If 
9 is non zero (mod27r), then, at the minima, the energy of the combined V plus 
u potential increases, which effectively lowers the barrier. We therefore expect 
that a sufficiently energetic soliton may enable transition from a meta-stable to 
a stable conformation (even without thermal noise). 
The full Lagrangian density reads: 

c= l f- \el - ^-^f + iw> 2 - v(4>) - u(9, 0). (4) 

Note that we used a continues limit for (p. This may be a reasonable approxima- 
tion if the 9 field changes slowly in space compared with the length of a peptide 
unit. To make this model more realistic one may vary the parameters from point 
to point along the molecular chain since they depend on the nature of the local 
residue. Interaction between different points along the protein, due to electro- 
static and Van der Waals forces, hydrogen bonding etc, can be introduced by 
letting the parameters depend on the overall conformation {i.e. they should be 
functionals of 0). Nevertheless, to test our proposal for SMF, the simple model 
will suffice. 

From Lagrangian (^) we get the equations of motion 

9 = 9 XX - 1 + A f sin _ r e 9 (5) 
1 + A v ' 

2A 

m( j) = -4e(0 + 5) (0 2 - 1) - ^a^ 1 ~ cos °) ~ T ^ ( 6 ) 

which also include dissipation terms. The scale was chosen in a way so that when 
cf) is at a minima ,the coefficient of the sin# term is exactly 1. 

First, let us consider the small interaction limit (we ignore dissipation at this 
stage). We choose 

9(x, t) = /( 7 (x - vt)) + A9(x, t) 

<f)(x,t) = 1 + A<f>(x,t), (7) 

where f(z) = 4 tan -1 e~ z is the usual SG kink soliton moving with velocity v (7 
is the relativistic factor). Assuming A is small and expanding to first order one 
finds: 

4A 1 



m cosh 2 7(x — vt) 
with uj 2 = 8e(l + 5)/m. The solution is 



4A r°° sin— (z' — z) 

A0 = — / dz' ^ -, (9) 

•yvrnu J z cosh z' 



where z = 7(2 — vt). Asymptotically, for large z 
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_4A e~ 2z 
^~ _ ^( 7W ) 2 + (cu/2) 2 ' (10) 

For v = we can write an explicit solution 

4A 

A <^ = 11 

ma; 2 cosh x 

We may compare this to the asymptotic result fllCf ). Interpolation yields the 
following approximate solitary wave solution: 

4A/m 1 , . 

v ( 7 w) 2 + (cj/2) 2 cosh 2 7(2; -u*)' V ; 

As we can see, near the center of the 9 kink, is pushed away from its local 
minima 0=1 towards the other local minima so we expect that for strong 
enough interaction this would allow to cross the energy barrier. 

In order to get further insight into the behavior of this model when interaction 
is strong, we set all time derivate terms in (|^) to zero, looking for a static solution. 
The equations can be solved exactly in the symmetric (5 = 0) case, we get (note 
that = is not a stable solution) 

^ 2= 1 - 2£[TTa)' 1 -cos9 '' < 13 > 

Inserting this to the equation for 9 we have 

9 X x = (1 — o) sin 9 + a sin 9 cos 9, (14) 

with 

A 2 

a = ; TT7T- (15) 

2e(l + A) 2 v ; 

A localized solution is 

# = 4tan~ 1 1 2 = 1 - - A - - — l — , (16) 

q + ^T+q 1 V e(l + A)l + g 2 ' 1 ; 

with q = y/l — a sinh(x — Xq). We note that no localized solution exist for a > 1. 
This is the strong interaction - low energy barrier regime, where the solitonic 
behavior is totally destroyed. For small asymmetry, we may try to calculate first 
order correction to the above result. Generally, introducing a perturbation AV^ 
in the static symmetric equation, it can be shown that the first order correction 
is given by 

^-iioWl^- 10 W AnA (17) 



where 9q and 0o are the solutions (16) with x = 0, and the Green function is 
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. if. . , (3a — l)asinhx + (3a + 1)(1 — a)xcoshx\ . , 

G(,)=^ ( i-.) 0n h,+i — 1+(1 : a) Jk — J- < ig ) 

unfortunately, for most of the interesting perturbations, including the asymmetric 
barrier perturbation 

AV = 4£6(<I> 2 -1), (19) 

the integral (|TT| ) can not be calculated analytically. 

We shall turn now to discuss results of numerical simulations for the model. 
When a topological constraint was enforced upon the 9 field (a difference of 2ir 
between the boundaries) and the system was allowed to relax via dissipation to a 
static conformation, a result similar to that of equation flnp was observed. Next 
we considered initial conditions of a moving 6 kink, with = 1 (local minima) for 
all x. Though it seems reasonable, based on the form of the interaction potential, 
that a traveling soliton might transfer enough energy to the field to lift it over 
the energy barrier, it is certainly not obvious a priori whether this energy would 
be returned to the soliton. Note that interacting soliton also radiates plasmons, 
so it losses energy through this channel too (even without dissipation). It is useful 
to define collective coordinate and momentum for a soliton 



oo roo 



Q = ~Y I P - I (i.rH,.f). (20) 



oo J — oo 



These are in fact canonical variables which represent the soliton's 'center of mass' 
and its conjugate momentum. The relation Q ~ P/E holds, with E being the 
total energy of the soliton (deviations from the exact formula are due to deforma- 
tion of the interacting soliton shape). We used P as an indicator for the soliton's 
kinetic energy. Simulations reveal that when 5 = (i.e. symmetric barrier) the 
kink slows down and stops after a short period of time. Next, S is decreased 
so that the = — 1 conformation becomes lower in energy. When interaction 
is strong enough, the kink transfers enough energy to the conformation angles, 
allowing them to cross the potential barrier and reach the ground state (0 = — 1), 
in practically every point it passes. However, the soliton extract back some of the 
conformation energy. After short period of time it reaches a steady state finite 
velocity (up to some periodic fluctuations), when the energy gained balances the 
energy lost. When dissipation is introduces, it lowers the steady state velocity, 
but the overall picture remains the same. Results of simulation are shown in fig. 
[I]. We conclude that within this model, a soliton is able to mediate folding (i.e. 
carry conformational degrees of freedom over energy barriers) and in the process 
compensate for energy losses by pumping it back from the folded field. 

In actual unfolded state local folding angles are supposed to be distributed 
somehow between a and (3 conformation. We have checked the behavior of a trav- 
eling kink in a disordered initial conditions, namely, when the field is randomly 
distributed between the two minima. Still, a steady state velocity is reached, 
though lower compared with the ordered case (see fig. |2|). This can be easily 
explained since less compensating energy is available to the soliton when already 
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half the conformational degrees of freedom are at their ground state. Neverthe- 
less, the folding mediated is not less effective. For sufficient asymmetry all the 
angles reach the lower energy. The soliton did not only lowered the energy but 
also the entropy. 

We have demonstrated, using a simple toy model, how soliton, propagating 
along a protein backbone provides an effective mechanism for fast and deter- 
ministic folding of the protein. By mediating correlated fold of large sections of 
the polypeptide chain, it reduces significantly the number of effective degrees of 
freedom, since large structures may be created by a single soliton. Moreover, 
these structures are created in a controlled process. It is possible that the amino 
acids sequence dictates whether soliton propagation would be allowed or sup- 
pressed in a specific location at a specific time, therefore the folding pathway 
may not be stochastic but, rather, predetermined. The limiting rate of the fold- 
ing process would be the rate in which solitons are produced. Soliton creation is 
probably induced spontaneously by thermal excitation. There might exist other 
mechanisms for creation of solitons, such as interaction with 'chaperones', en- 
zymes that catalyze protein folding. It is our belief that the concept of SMF will 
improve scientific understanding of such phenomena. 

The authors wish to thank Ziv Hermon for valuable discussions. This work 
has been partly supported by GIF grant. 
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FIG. 1. The collective momentum and the energy of a soliton moving through a 
protein which is initialy in a metastable (unfolded) state. The momentum is indicated 
by the dashed line. Continues line indicates soliton energy. The energy of the con- 
formation field is shown in the small box. The parameters have the following values: 
e = l,m = 10, A = 100,5 = —0.5,70 = 0.1, = 0.1. Note that, although dissipation 
is introduced, the soliton momentum has a constant positive average value. 
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FIG. 2. The collective momentum and the energy in a disordered initial conditions. 
Dissipation parameter Tq was set to zero. All other parameters have the same value as 
in Fig. [I] . The protein conformation energy is shown is the small box. 
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